Doc Ref. AMI 
Appl. No. 09/754,256 



(19) 



(12) 



Europdisches Patentamt 
European Patent Office 
Office europ6en des brevets (i 1) EP 0 71 4 066 A2 

EUROPEAN PATENT APPLICATION 



(43) Date of publication: 

29.05.1996 Bulletin 1996/22 

(21) Application number: 95114510.1 

(22) Date of filing: 15.09.1995 



(51) Int. Cl.^: G06F 12/08, G06F 17/30 



(84) Designated Contracting States: 
DE FR GB 

(30) Priority: 23.11.1994 US 344325 

(71) Applicant: International Business Macfiines 
Corporation 

Armonic, N.Y. 10504 (US) 

(72) Inventors: 

• Chen, Ming-Syan 
Yorktown Heights, New York 10598 (US) 



• Wu, Kun-Lung 

Yorktown Heights, New York 10598 (US) 

* YU, Philip Shi-Lung 
Chappaqua, New York 10514 (US) 

(74) Representative: Schdfer, Wolfgang, Dipl.-lng. 
IBM Deutschland 
Informationssysteme GmbH 
Patentwesen und Urheberrecht 
D-70548 Stuttgart (DE) 



Fiai 



(54) Communications system and method including energy-efficient caching for mobile 
computing 

(57) A comnrrunlcations system and method include 
an efficient cache invalidation technique which allows a 
computer to relocate and to disconnect without informing 
the server The server partitions the entire database into 
a number of groups. The server also dynamically identi- 
fies recently updated objects in a group and excludes 
them from the group when checking the validity of the 
group. If these objects have already been included in the 
most recent invalidation broadcast, the remote computer 
can invalidate them in its cache before checking the 
group validity with the server. With the recently updated 
objects excluded from a group, the server can conclude 
that the cold objects in the group can be retained In the 
cache, and validate the rest of the group. 
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Description 

BACKGROUND OF THE INVENTION 

5 FIELD OF THE INVENTION 

The present Invention relates to communications systems and more particularly to communications systems includ- 
ing mobile computing systems. 

10 PRI OR ART 

In a mobile computing environment, a large number of battery-powered, portable machines can be used by many 
users to query information and database servers from various places through wireless communication channels. These 
nfK)bile computers may often be disconnected for prolonged periods of time in order to conserve the battery energy. 

15 They may also frequently move from one cell to another and connect to different data servers at different times, enabling 
unrestricted mobility of the users. Due to the unlimited mobility, users of mobile computers can satisfy their information 
needs whenever and wherever they want. 

Generally, the bandwidth of the wireless communication channel is very limited. Thus, caching of frequently used 
data in a wobWe computer can be an effective technique to reduce the wireless bandwidth requirement. Once caching 

20 is used, however, a cache validation strategy is needed to ensure the data cached in the mobile computer are coherent 
with those in the server. 

There are generally two different prior art approaches to cache invalidation, depending on whether or not the servers 
are aware of which data objects are being cached by which mobile conputers. In a first prior art approach, the server 
maintains the cache state of each mobile computer and is called a statefull server. Cache invalidation can be done via 

25 sending an invalidation message to the appropriate mobile computers whenever an object is updated in the server. The 
server must also keep track of the on-off status of each mobile computer. In case a mobile computer fails, the server 
may lose track of the cache state of the mobile computer. Thus, a mobile computer must inform the server whenever it 
wants to disconnect itself or whenever it wants to move from one cell to another cell, significantly restricting the freedom 
and mobility of the mobile user. 

30 In a second prior art approach, the server does not keep track of the cache state of each mobile computer and is 
called a stateless server. It does not even know which mobile computers are currently active. Cache invalidation is 
typically done via periodically broadcasting an invalidation report, which contains the IDs of objects that are most recently 
updated. Each mobile computer listens, if active, to the report and invalidates its cache contents accordingly. However, 
when a mobile computer is disconnected for energy-saving purpose, it may miss many invalidation broadcasts. As a 

35 result, when it wakes up, it may have to discard the entire cache contents because it does not know whether or not some 
of its cached objects have been updated since it went to sleep. Discarding the entire cache content after a disconnection 
is very costly because most of the benefits of caching to save bandwidth requirement are lost. It is also unnecessary to 
discard the entire cache content because most of the cached objects in the mobile computers may be updated rather 
infrequently and can be salvaged even after a prolonged disconnection. 

40 To salvage some of the cached objects after a disconnection, the mobile computer can check with the server. Even 
though the server is stateless, it can maintain the object update history. One simple way of validity checking is to send 
all the IDs of its cached objects to the server. This is called simple checking (SC). However, sinple checking requires a 
high communication overhead espedally if the cache size is large. 

To reduce the communication overhead, the database objects in the server can be partitioned into groups and a 

45 mobile computer can check the validity of a group, instead of each individual object. If the entire group has not been 
updated, the group can be salvaged. However, if any object within a group has been updated, the entire group has to 
be discarded from the cache. This is called simple grouping (SG). In SG, if the group size is large, discarding the entire 
group can also be costly and unnecessary. 

Each of the prior art caching techniques identified above have disadvantages which reduce system efficiency as a 

50 result of large communications overhead required to update the cache of a remote computer. 

SUMMARY OF THE INVENTION 

Therefore, it is an object of the present invention to allow a computer to move from one cell to another without 
55 informing a server and to disconnect from the server for saving battery energy but, when it wakes up. still retain most 
of its cached storage. 

In accordance with a preferred embodiment of the present invention a communications system and metiiod including 
an efficient cache invalidation technique allows a computer to relocate and to disconnect witiiout informing the server. 
The server partitions the entire datat>ase into a number of groups. The server also dynamically identifies recently updated 
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objects in a group and excludes them from the group when checking the validity of the group, ff these objects have 
already been included in the most recent invalidation broadcast, the remote computer can invalidate them in its cache 
before checking the group validity with the server. With the recently updated objects excluded from a group, the server 
can conclude that the cold objects in the group can be retained in the cache, and validate the rest of the group. This 
technique is referred to as Grouping with COW update set REtention (GCORE). GCORE is energy efficient because it 
allows a mobile computer to disconnect at any time to save energy and still retains most of the cache contents if they 
have not been updated and thus are considered to be cold update set Moreover, it does not incur a high communication 
overhead by using grouping of data. 

In a preferred embodiment, GCORE is used to support energy-efficient caching in a remote computing system. 
Database objects are partitioned into a small number of groups. To retain the cokJ objects in a group that have not been 
updated, the server maintains for each group an object update history of the past IVbroadcast intervals, consisting of 
a list of object IDs and their most recent update times, and the most recent update time of the group. Periodically, the 
server broadcasts an invalidation report containing the last broadcast intervals of object update history. If a remote 
computer is active, it invalidates its cache entries based on the invalidation report. In case this broadcast report is the 
first one received since the remote computer has woken up, it also sends to the server an uplink message, containing 
the group ids and the timestamp of the latest broadcast report received before it became disconnected. The server 
checks the validity of a group by examining whether or not all the objects updated since the mobile computer becomes 
disconnected have been included in the latest broadcast report. After receiving the validity report from the server, the 
just woken-up remote computer then processes its queries initiated since it has woken up. Thus, in GCORE cache 
invalidation is accomplished by an asymmetrical approach where a server periodically broadcasts invalidation reports 
and a remote computer can check its cache validity after disconnection and reconnection. 

The foregoing has outlined rather broadly the features and technical advantages of the present invention in order 
that the detailed desaiption of the invention that follows may be better understood. Additional features and advantages 
of the invention will be described hereinafter which form the subject of the claims of the invention. 

BRIEF DESCRIPTION OF THE DRAWING 

For a more complete understanding of the present invention, and the advantages thereof, reference is now made 
to the following descriptions taken in conjunction with the acconrpanying drawings, in which: 

FIGURE 1 is a block diagram of a communications system embodying the present invention; 

FIGURE 2 is a flow diagram of a process for maintaining a list of data objects updated since last broadcast; 

FIGURE 3 is a flow diagram of a process for maintaining group update history; 

FIGURE 4 is a flow diagram of a process for fast checking of group validity; 

FIGURE 5 i*:, a flow diagram of a process for preparing a new broadcast report; 

FIGURE 6 is a flow diagram of query processing by a remote computer; and 

FIGURE 7 is a flow diagram of grojp validity checking by a server. 

DETAILED DESCRIPTION OF A PREFERRED EMBODIMENT OF THE INVENTIOM 

Referring now to FIGURE 1 . a block diagram of a communications system according to a preferred en^odiment of 
the present invention will be described. 

In the following description, it is assumed that database objects are maintained on the servers 2 which are connected 
by a communication network 3. It is also assumed that the database is rqDiicated on each mobile server 2. 7>ius, a 
remote computer can query the database from any wireless radio cell. Remote corrputers 1 can connect to one of the 
servers 2 by a wireless communication channel. For the purpose of cache invalidation, when a computer 1 relocates 
from one wireless radio cell 12, 14. 16, 18 to another cell 14. 16. 18. 12. it need not notify any of the servers 2. Queries 
are executed on the remote computers 1 while updates to tiie databases are only executed on the servers 2. In order 
to process queries with the more up-to-date data, queries are batched and processed right after a remote computer 1 
receives a new broadcast report. Once referenced by queries, database objects are cached in the remote computers' 
1 nonvolatile memory, such as hard disks (not shown) in the remote computers 1. Periodically, servers 2 broadcast 
invalidation reports containing the object IDs and tiieir most recent update times, H active, a remote computer 1 then 
invalidates its local cache according to the invalidation report. In order to retain the benefits of caching while allowing 
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remote computers 1 to disconnect (power off) or relocate at will, the servers 2 inplement a cache invalidation technique 
called grouping with cold update set retention (GCORE) 4. 

A server 2 broadcasts an invalidation report every L units of time. Namely, invalidation reports are broadcast at 
T- 2L T- L and T To implement GCORE. three important data structures are maintained by the servers 2. The first 
one. curlisl contains a list of pairs of (oid. time) for each distinct object updated since the last invalidation broadcast, 
where oid is the object identifier and time is the object update time. 



struct pair 
{ 

int oid; /* object identifier */ 
double time; /* object update time */ 

) 

struct pair *curlist; /* a pointer to a list of pairs */ 



The second one, bcast_report contains the broadcast time and a list of pairs of objects updated in the most recent w 
broadcast intervals. 

struct bcast_report 
I 

double time; /* broadcast time */ 

struct pair *uplist; /* a pointer to a fist of pairs */ 

} 

The third one, grouptbl, contains an an-ay of group entries and is used to maintain the update history for each group. 
The server 2 maintains the update history for each group for the most recent broadcast intervals, where Wsw. 
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struct group„entry 

5 

{ 

double time; /* most recent update time for a group */ 
int tot_wW; /* total number of distinct objects most 

10 

recently*/ */ updated between T-WL and T-wL */ 

struct pair ♦uplist; /* a pointer to a list of pairs */ 

} 

15 

struct group_entry grouptbl [ ] ; 



20 

The grouping function can be as simple as a modulo function where the number of objects in each group is about the 
same. On the other hand, the grouping function can also be different for different types of objects, resulting in different 
group sizes. Whatever grouping function is chosen, it must be agreed upon between the servers 2 and the remote 
computers 1. ft is assumed that the latest broadcast time (latest_bcastjime) is kept in the nonvolatile memory (not 

25 shown) of each remote computer 1 so that after a remote computer 1 wakes up it can obtain this information. 

For every object updated, the server 2 executes two processes: insert_curlistm6 insert jgrouptbl FIGURE 2 shows 
the flow chart for insert_curlist. When object A is being updated 210, the curlist is checked to see if there is a pair that 
has the same object identifier as A 211. ff yes, then the update time of this pair is changed to the current time 212. 
Othenvise. a new pair is created containing the object ID A and the current update time and is inserted at the top of 

30 curlist 213. 

FIGURE 3 shows thef low chart for insert_grouptbL When an object, for example A, is updated. 320, the con-espond- 
ing group ID of object A is first computed 321 . If such group ID is gid. then grouptblfeidl.time is changed to the current 
time. Next, all the pairs pointed to by grouptbl[gid].uplist are checked to see if any of them has the same object identifier 
as ot^ect A 322. If yes, then its corresponding update time is changed to the current time 324. Othenvise, a new pair is 
35 created and inserted at the top of grouptbI[gidJ.uplist for object A 323. 

Every L units of time, the server 2 broadcasts an invalidation report and updates the grouptbl accordingly. FIGURE 
4 shows the flow chart of advance^grouptbl. For each grouptbl entry i, the server removes the pairs pointed to by 
grouplblH.uplist that have update times less than T- WL, where 7 is the cun-ent broadcast time 430. Also, for each 
grouptblp] the total number of pairs that have update times between T - WLand T - wL\s recomputed for the new 
40 broadcast time 7431 . The recomputation of grouptbl[i].tot_wW is for fast checking of group validity. 

FIGURE 5 shows the flow chart of preparing the new broadcast report, new_jeport. First, ail the pairs pointed to by 
bcast_report.uplist that have update times less than T - wL are removed 540. After that, any pair pointed to by 
bcast_reportuplist that has the same object ID as that in curlist is also removed 541 . Then, the entire list pointed to by 
curlist is inserted at the top of bcast_report.uplist. curlist is reset to NULL, and bcast_report.time is set to the current 
45 broadcast time 542. By doing so, the bcast_report will contain only the distinct object pairs that are updated in the most 
recent wbroadcast intervals. 

For a remote computer 1 to process queries with up-to<Jate data, all the queries are batched until it receives a new 
broadcast report. 

FIGURE 6 shows the flow chart of query processing by a remote con^uter 1 . Upon receiving a new bcast_report, 
so the computer 1 first invalidates its cache contents based on this report 650. If this computer 1 has been disconnected 
and this is the first beast-report it receives after wake up and latest_bcast_time < bcast_report.time 651 , it then sends 
its latest_bcast_time and its group IDs to the server 2 tor validity checking 652. It then waits for the validity results from 
the server 2 arxJ invalidates its cache 653. Then the computer 1 processes all the queries 654. If objects cannot be 
found in the local cache (not shown), it sends requests to the server 254. 
55 For group validity checking, the server executes a simple process. FIGURE 7 shows the flow chart of this validity- 
checking process. For each grouptbl[0 760. the server 2 checks if grouptblfij.time < latest_bcast_time 761. If yes. then 
grouptblpj is valid 763. If not, then it further checks if grouptbl[i].tot_wW = 0 and grouptbllO-time >(T-WL) 762. If yes. 
then grouptblp] is also valid 763. OthenMse, grouptblp] is invalid 764. 
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Although the present invention and its advantages have been described in detail, it should be understood that various 
changes, substitutions and alterations can be made herein without departing from the spirit and scope of the invention 
as defined by the appended claims. 

Claims 

1 . A method of reducing overhead in a communications system having one or more remote units in communications 
with one or more servers, where said remote units are adapted to operate in a disconnected mode for a period of 
time, comprising the steps of: 

combining data objects stored in said one or more servers into one or more groups; 
broadcasting invalidation reports by one or more of said servers to one or more remote computers; 
dynamically excluding by said one or more servers one or more most recentiy updated objects from a group;and 
efficiently checking at said one or more sen/ers a group validity without object by object comparisons. 

2. A method of redudng overhead in a communications system according to Claim 1 » wherein 

said broadcasting step further comprises periodically broadcasting by one of said servers to said one or more remote 
conputers an invalidation report containing one or more objects updated during a most recent plurality of broadcast 
intervals. 

3. A method of reducing overhead in a communications system according to Claim 1 or 2. further comprising the step of: 
invalidating contents of caches in said one or more remote computers based on information contained in said inval- 
idation reports. 

4. A method of reducing overhead in a communications system according to one of claims 1 to 3, furtiier comprising 
the step of: 

maintaining at said one or more servers a latest ipdate time of a group and a number of objects in a group which 
are updated during a predetermined time interval. 

5. A metiiod of reducing overhead in a communications system according to one of claims 1 to 4, further comprising 
the steps of: 

waiting for a next invalidation broadcast to invalidate cache entries in one of said remote conputers; and 
sending a time identifier of a last invalidation report received by said remote computer before a most recent discon- 
nection of said remote computer 

6. A method of reducing overhead in a communications system according to one of claims 1 to 5. furttier comprising 
the step of: 

determining at said one or more servers a validity of a group which excludes one or more objects updated during 
a predetermined broadcast interval. 

7. A data communications system comprising: 
a communications network; 

one or more servers connected to said communications network; and 

one or more remote computers adapted to operate in a disconnected mode for a period of time, and adapted to 
operate in communication with said one or more servers, each of said remote computers comprising means respon- 
sive to a broadcast invalidation report fi^om one of said servers for invalidating cache contents identified in said 
invalidation report. 

8. A data communications system according to Claim 7 further comprising: 

means in said one or more servers for dynamically excluding one or more most recently updated objects from a 
group of objects; and 

means for efflcientiy checking by said one or more servers a group valicfity without object by object comparisons. 

9. A data communications system according to Claim 7 or 8, further comprising: 

means for maintaining at said one or more servers a latest update time of each group of objects and a number of 
objects in each said group that have been updated during a predetermined time interval. 
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(57) A communications system and method include 
an efficient cache invalidation technique which allows a 
computer to relocate and to disconnect without Inform- 
ing the server. The server partitions the entire database 
into a number of groups. The server also dynamically 
identifies recently updated objects in a group and 
excludes them from the group when checking the valid- 
ity of the group. If these objects have already been 
included in the most recent invalidation broadcast, the 
remote computer can invalidate them in its cache before 
checking the group validity with the server. With the 
recently updated objects excluded from a group, the 
server can conclude that the cold objects in the group 
can be retained in the cache, and validate the rest of the 
group. 



nai 




Primed by Rank Xerax (UK) Business Services 
2.i4.7/a4 



EP 0 714 066 A3 



European Patent 
Office 



EUROPEAN SEARCH REPORT 



Application Number 

EP 95 11 4510 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Catccory 



CitaCioD of docament with iodicadon, wbere appropriate, 
of rekvapt passages 



Relevant 
to daiiD 



CLASSIFICATION OF THE 
APPUCATION antCI.6) 



SIGMOD RECORD, 

vol. 23, no. 2, 1 June 1994, 
pages 1-12, XP000454444 
BARBARA D ET AL: "SLEEPERS AND 
WORKAHOLICS: CACHING STRATEGIES IN MOBILE 
ENVIRONMENTS- 

* page 3, left-hand column, line 21 - page 
4, right-hand column, line 6 * 

USENIX SUMMER CONFERENCE PROCEEDINGS, 
6 June 1994, 
pages 279-289. XPO0057549O 
MUMMERT L ET AL: "LARGE GRANULARITY CACHE 
COHERENCE FOR INTERMITTENT CONNECTIVITY" 

* page 279 - page 281. right-hand column, 
line 36 * 

ACM TRANSACTIONS ON COMPUTER SYSTEMS, 
vol. 10, no. 1, 1 February 1992, 
pages 3-25, XP000323223 
KISTLER J J ET AL: "DISCONNECTED 
OPERATION IN THE CODA FILE SYSTEM" 

* page 10 - page 17 * 



G06F12/08 
G06F17/30 



1-6,8,9 



1-5,7-9 



1-9 



TECHNICAL FIELDS 
SEARCHED antCL6) 



G06F 



Tbe present search report has been drawn up for all dains 



THE HAGUE 



Dale af cawylrtto of tkc tcvtfc 

15 April 1997 



Nielsen, 0 



CATEGORY OF aTED DOCUMENTS 

X : partiaiUriy rdcvani if taken alon* 

Y : partidilaHy rdevant if combined with another 

document of the sane categoiy 
A : techaolo^cal backgnnmd 
O : Don-writtcn disdostue 
P : iotenscdlate document 



T : thewy or priadple uadcrtying the invention 
E : earlier patent document, but poUished on, or 

after the filing date 
D : document dted ia the application 
L : documest dted for other reasons 



A : member of tbc same patent family, corresponding 
docomait 



2 



